impr(polyglot): Add new structure (wordsMap) to PolyglotWordset to get words from any language by uniform distribution (@IliyaZinoviev) by IliyaZinoviev · Pull Request #7440 · monkeytypegame/monkeytype

IliyaZinoviev · 2026-01-24T10:55:39Z

Add uniform distribution to polyglot

Motivation

During polyglot using, I mentioned that words from language with huge dictionaries are met always, but words from small one - never (at least from a user experience perspective, it looks like this). My PR solves this issue. And now each language is consumed per test as much as any other.

Description

Changed:

PolyglotWordset class
places in files where PolyglotWordset is used:
- words-generator.ts
- funbox-functions.ts .

Checks

Closes #

@IliyaZinoviev

…t words from any language by uniform distribution (@IliyaZinoviev)

Copilot

Pull request overview

Adjusts polyglot word generation so languages are sampled (approximately) uniformly, rather than being dominated by languages with larger dictionaries.

Changes:

Refactors PolyglotWordset to store per-language Wordsets (wordsMap) and pick a language uniformly per generated word.
Updates word generation logic to use PolyglotWordset.currentLanguage instead of per-word language lookups.
Small control-flow cleanup in word list retrieval / funbox wordset assignment.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
frontend/src/ts/test/words-generator.ts	Switches polyglot language detection to `currentLanguage`; minor cleanup around word list returns and funbox wordset assignment.
frontend/src/ts/test/funbox/funbox-functions.ts	Reworks `PolyglotWordset` internals to use `wordsMap` and uniform language selection.

Copilot · 2026-02-27T14:07:41Z

frontend/src/ts/test/funbox/funbox-functions.ts

+    super([]);
    this.languageProperties = languageProperties;
+    this.langs = Array.from(languageProperties.keys());
+    this.wordsMap = wordsMap;
+    this.resetIndexes();
+    this.length = Array.from(this.wordsMap.values()).reduce(
+      (sum, ws) => sum + ws.words.length,
+      0,
+    );


PolyglotWordset now calls super([]) and never populates this.words. Code in words-generator relies on currentWordset.words (eg repeat-quote path + hasTab/hasNewline checks), so polyglot wordsets will report an empty list. Consider keeping words as an aggregate of all per-language words (or overriding/adding an accessor and updating consumers to use it).

Suggested change

super([]);

this.languageProperties = languageProperties;

this.langs = Array.from(languageProperties.keys());

this.wordsMap = wordsMap;

this.resetIndexes();

this.length = Array.from(this.wordsMap.values()).reduce(

(sum, ws) => sum + ws.words.length,

0,

);

const aggregatedWords = Array.from(wordsMap.values()).reduce<string[]>(

(all, ws) => all.concat(ws.words),

[],

);

super(aggregatedWords);

this.languageProperties = languageProperties;

this.langs = Array.from(languageProperties.keys());

this.wordsMap = wordsMap;

this.resetIndexes();

this.length = aggregatedWords.length;

Copilot · 2026-02-27T14:07:41Z

frontend/src/ts/test/funbox/funbox-functions.ts

+  }
+
+  override resetIndexes(): void {
+    this.wordsMap.forEach((ws, _) => {


resetIndexes() uses this.wordsMap.forEach((ws, _) => ...) but the key param is unused. Drop the second arg to avoid unused-var lint noise.

Suggested change

this.wordsMap.forEach((ws, _) => {

this.wordsMap.forEach(ws => {

monkeytypegeorge added the frontend User interface or web stuff label Jan 24, 2026

IliyaZinoviev force-pushed the feature/balanced-polyglot-v3 branch 2 times, most recently from 2d5e9ad to ed49d47 Compare January 24, 2026 14:18

impr(polyglot): Add new structure (wordsMap) to PolyglotWordset to ge…

52be7ba

…t words from any language by uniform distribution (@IliyaZinoviev)

IliyaZinoviev force-pushed the feature/balanced-polyglot-v3 branch from ed49d47 to 52be7ba Compare January 27, 2026 06:56

Miodec requested a review from Copilot February 27, 2026 14:01

github-actions bot added the waiting for review Pull requests that require a review before continuing label Feb 27, 2026

Copilot started reviewing on behalf of Miodec February 27, 2026 14:02 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Miodec added waiting for update Pull requests or issues that require changes/comments before continuing and removed waiting for review Pull requests that require a review before continuing labels Feb 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

impr(polyglot): Add new structure (wordsMap) to PolyglotWordset to get words from any language by uniform distribution (@IliyaZinoviev)#7440

impr(polyglot): Add new structure (wordsMap) to PolyglotWordset to get words from any language by uniform distribution (@IliyaZinoviev)#7440
IliyaZinoviev wants to merge 1 commit intomonkeytypegame:masterfrom
IliyaZinoviev:feature/balanced-polyglot-v3

IliyaZinoviev commented Jan 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	this.wordsMap.forEach((ws, _) => {
	this.wordsMap.forEach(ws => {

Uh oh!

Conversation

IliyaZinoviev commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add uniform distribution to polyglot

Motivation

Description

Checks

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

IliyaZinoviev commented Jan 24, 2026 •

edited

Loading